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Abstract. We explore the mass-to-light ratio in galaxy clusters and its relation to the cluster mass. We study the relations among 
the optical luminosity (L,,,,), the cluster mass (M200) and the number of cluster galaxies within ^200 (A'gn/) in a sample of 217 
galaxy clusters with confirmed 3D overdensity. We correct for projection effects, by determining the galaxy surface number 
density profile in our cluster sample. This is best fitted by a cored King profile in low and intermediate mass systems. The core 
radius decreases with cluster mass, and, for the highest mass clusters, the profile is better represented by a generalized King 
profile or a cuspy Navarro, Frenk & White profile. We find a very tight proportionality between L„p and Ngai, which, in turn, 
links the cluster mass-to-light ratio to the Halo Occupation Distribution Ng„i vs. M2m- After correcting for projection effects, 
the slope of the L^p - M200 and Ngai - M200 relations is found to be 0.92 ± 0.03, close, but still significantly less than unity. We 
show that the non-linearity of these relations cannot be explained by variations of the galaxy luminosity distributions and of the 
galaxy M/L with the cluster mass. We suggest that the nonlinear relation between number of galaxies and cluster mass reflects 
an underlying nonlinear relation between number of subhaloes and halo mass. 



1. Introduction 

Clusters of galaxies are the most massive gravitationally bound 
systems in the universe. The cluster mass function and its evo- 
lution provide constraints on the evolution of large-scale struc- 
ture and important cosmological parameters such as Q.,,, and 
erg. Cluster mass-to-light ratios (M/L hereafter) provide one of 
the most robust determination of Q„, in connection with the ob- 
served luminosity density in the Universe via the Oort (1958) 
method. In this method, a fundamental assumption is that the 
average M/L of clusters is a fair representation of the univer- 
sal value. For this reason, many works have focussed on the 
dependence of the cluster M/L on the mass of the systems. In 
general, M/L has been found to increase with the cluster mass. 
Assuming a power-law relation M/L oc M", and adopting the 
usual scaling relations between mass and X-ray temperature or 
velocity dispersion, when needed, most authors have found a 
in the range 0.2-0.4, in both optical and near-infrared bands, 
and over a large mass range (Adami et al. 1998a; Bahcall & 
Comerford 2002; Girardi et al. 2002; Lin et al. 2003, 2004; 
Rines et al. 2004; Ramella et al. 2004; see however Kochanek 
et al. 2003 for a discordant result). Why does the cluster M/L 
increase with the mass? Based on the results of numerical sim- 
ulations, Bahcall & Comerford (2002) have proposed that the 
trend of M/L with mass is caused by the stellar populations of 
galaxies in more massive systems being older than the stellar 
populations of galaxies in less massive systems. In this sce- 



nario, the slope of the M/L - M relation should be steeper 
in the B and V bands, dominated by the young stellar popu- 
lations, than at longer wavelengths, eventually becoming flat in 
the infrared K band, dominated by the light of the old stellar 
population. Such a scenario is not consistent with the results 
of the semi-analytical modeling of KaufFmann et al. (1999), 
where the M/L is predicted to increase with mass with approx- 
imately the same slope in the B and / band. Also observation- 
ally, the slope of the M/L - M relation is found to be the same 
in different bands, the Z?-band (Girardi et al. 2002) the V-band 
(Bahcall & Comerford 2002), the R-bwA (Adami et al. 1998a, 
Popesso et al. 2005b,2005c) and the K-hwd (Lin et al. 2003, 
2004; Rines et al. 2004; Ramella et al. 2004). An alternative 
interpretation of the increasing M/L with system mass is pro- 
vided by Springel & Hernquist (2003). They analyze the star 
formation efficiency within halos extracted from cosmological 
simulations, with masses in the range 10*^ - IO'^/Wq, and find 
that the integrated star formation efficiency decreases with in- 
creasing halo mass by a factor 5-10 over the cluster mass range. 
This scenario is investigated by Lin et al. (2003), who con- 
vert the 2MASS /T-band cluster luminosities into cluster stellar 
masses. They find that the fraction of mass in stars is a decreas- 
ing function of the cluster mass {Msiar/M,,,, oc M/^l}^^). 

In this paper we address the above issues by studying M/L 
for a sample of 217 clusters, which span the entire cluster 
mass range. In particular, we study the relations among the 
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cluster optical luminosity L^p, the mass M200, and the num- 
ber of cluster galaxies Ngai, within the virial radius r2oo- We 
find a very tight relation between L„p and Ngah which links 
the Lop - M200 relation (and therefore, the cluster M/L), to the 
Halo Occupation Distribution (HOD hereafter) Ngai - M2oo- 
The HOD is a powerful tool for describing galaxy bias and 
modelling galaxy clustering (e.g. Ma & Fry 2000; Peacock 
& smith 2000; Seljak 2000; Scoccimarro et al. 2001; Berlind 
& Weinberg 2002). It characterizes the bias between galaxies 
and mass in terms of the probability distribution P(N|M) that a 
halo of virial mass M contains N galaxies of a given type, to- 
gether with relative spatial and velocity distributions of galax- 
ies and dark matter within halos. The HOD is a fundamental 
prediction of galaxy formation theory (e.g. Kauffmann, Nusser 
& Steinmetz 1997, Kauffmann et al. 1999; White, Hernquist 
& Springel 2001; Yoshikawa et al. 2001; Berlind et al. 2003; 
Kravtsov et al. 2004; Zheng et al. 2005) and it can be extremely 
useful to compare the observational results with the theoretical 
models. 

This paper is organized as follows. In section 2 we describe 
our dataset. In section 3 we describe the methods we use to 
calculate several cluster properties, like the characteristic ra- 
dius, the virial mass, the optical luminosity, and the number 
density profile of cluster galaxies. In section 4 we analyze the 
Lop - M200 and the Ngai - M200 relations, and find that the num- 
ber of galaxies per given halo mass decreases as the halo mass 
increases. In section 5 we seek for a physical explanation of 
this trend, also by comparing our results with theoretical pre- 
dictions. Finally, in section 6 we provide our conclusions. 

Throughout this paper, we use Hq = 70 km s"' Mpc"' in a 
flat cosmology with Q.o - 0.3 and Q.^ = 0.7 (e.g. Tegmark et 
al. 2004). 



2. The data 

The optical data used in this paper are taken from the Sloan 
Digital Sky Survey (SDSS, Fukugita 1996, Gunn et al. 1998, 
Lupton et al. 1999, York et al. 2000, Hogg et al. 2001, 
Eisenstein et al. 2001, Smith et al. 2002, Strauss et al. 2002, 
Stoughton et al. 2002, Blanton et al. 2003 and Abazajian et al. 
2003). The SDSS consists of an imaging survey of n steradians 
of the northern sky in the five passbands u, g, r, i, z, in the entire 
optical range. The imaging survey is taken in drift-scan mode. 
The imaging data are processed with a photometric pipeline 
(PHOTO, Lupton et al. 2001) specially written for the SDSS 
data. For each cluster we defined a photometric galaxy cata- 
log as described in Section 3 of Popesso et al. (2004, see also 
Yasuda et al. 2001). For the analysis in this paper we use only 
SDSS Model magnitudes. The discussion about completeness 
limits in magnitude and surface brightness of the SDSS galaxy 
photometric sample can be found in Popesso et al. (2005a, 
2005b, papers II and IV of this series). 

The spectroscopic component of the survey is carried out 
using two fiber-fed double spectrographs, covering the wave- 
length range 3800-9200 A, over 4098 pixels. They have a reso- 
lution AA/A varying between 1850 and 2200, and together they 
are fed by 640 fibers, each with an entrance diameter of 3 arc- 
sec. The fibers are manually plugged into plates inserted into 
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Fig. 1. Redshift distribution of the cluster sample used in this 
paper. 



the focal plane; the mapping of fibers to plates is carried out by 
a tiling algorithm (Blanton et al. 2003) that optimizes observ- 
ing efficiency in the presence of large-scale structure. 



2.1. The cluster sample 

In this paper we use a combined sample of X-ray selected 
galaxy clusters and optically selected systems. The X-ray se- 
lected clusters are taken from the RASS-SDSS galaxy cluster 
catalog of Popesso et al. (2005b, hereafter paper III), which 
comprises 130 systems selected mainly from the REFLEX and 
NORAS X-ray cluster catalogs. The optically selected clusters 
are taken from Popesso et al. (2005c), who selected a sub- 
sample of 130 Abell clusters with confirmed 3d galaxy over- 
density in the third release of the SDSS galaxy spectroscopic 
catalog. The two samples overlap with 43 clusters. The com- 
bined sample with the exclusion of the double detections com- 
prises 217 clusters and covers the entire range of masses and X- 
ray/optical luminosities, from very low-mass and X-ray/optical 
faint groups (lO'^MO) to very massive and X-ray /optical bright 
clusters (5 x lO'^MO). The cluster sample comprises only 
nearby systems at the mean redshift of 0.1. The redshift dis- 
tribution of the cluster sample is shown in Fig.^ 



3. The cluster properties 

In this section we explain the methods used to calculate the 
cluster properties as the characteristic radius, the virial mass, 
the optical luminosity and the parameters of the radial profile 
of the cluster galaxies. 
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3.1. Characteristic radii and masses 

We here describe the methods by which we measure the char- 
acteristic cluster radii r2oo and mass M2oo- '"200 and M200 are 
the radius and the mass, respectively, where the mass density 
of the system is 200 times the critical density of the Universe 
and it is considered as a robust measure of the virial radius of 
the cluster 

Estimates of a cluster velocity dispersion, mass, and char- 
acteristic radius requires knowledge of the redshifts of its mem- 
ber galaxies. We have used the redshifts provided in the SDSS 
spectroscopic catalog. 

Cluster members are selected following the method of 
Adami et al. (1998a) or Girardi et al. (1993), depending on 
whether the mean cluster redshift Zdusier is known in advance 
(from previous studies) or not, respectively. Girardi et al. 
(1993) method requires in fact that a preliminary cut be done in 
the line-of-sight velocity space, ±4000 km s"' around czduster, 
before searching for significant weighted-gaps in the velocity 
distribution. On the other hand, the density-gap technique of 
Adami et al. does not require such a preliminary cut to be op- 
erated. If Zduster is known already, we select among the groups 
identified by the gapping technique that one closest in veloc- 
ity space to czciuster, otherwise we select the most populated 
one. After the initial group selection, we apply the interloper- 
removal method of Katgert et al. (2004; see Appendix A in that 
paper for more details) on the remaining galaxies, using the X- 
ray center when available, or else the position of the brightest 
cluster galaxy on the cluster colour-magnitude sequence. 

The virial analysis (see, e.g., Girardi et al. 1998) is then per- 
formed on the clusters with at least 10 member galaxies. The 
velocity dispersion is computed on the cluster members, using 
the biweight estimator (Beers et al. 1990). The virial masses 
are corrected for the surface pressure term (The & White 1986) 
by adopting a profile of Navarro et al. (1996, 1997; NFW here- 
after) with a concentration parameter, c, that depends on the 
initial estimate of the cluster virial mass itself. The c-mass re- 
lation is given by c = 4 x (M/Mj^^m) " '"^ where the slope of 
the relation is taken from Dolag et al. (2004), and the normal- 
ization Mkbm - 2 X lO'^Mo from Katgert et al. (2004). The 
clusters in our sample span a range c ^ 3-6. 

Correction for the surface pressure term requires knowl- 
edge of the r2oo radius, for which we adopt Carlberg et al. 
(1997) definition (see eq.(8) in that paper) as a first guess. After 
the virial mass is corrected for the surface pressure term, we re- 
fine our r20() estimate using the virial mass density itself. Say 
Myir the virial mass (corrected for the surface term) contained 
in a volume of radius equal to a chosen observational aperture, 
rap. The radius r2()o is then given by: 

r20O=rap\Prir/{200pc)]''^-^ (1) 

where py,> = 3Mi,,>/(47rr;^p) and pdz) is the critical density at 
redshift z in the adopted cosmology. The exponent in eq.Q is 
the one that describes the average cluster mass density profile 
near r2oo, as estimated by Katgert et al. (2004) for an ensemble 
of 59 rich clusters. 

A NFW profile is used to interpolate (or, in a few cases, 
extrapolate) the virial mass M^ir from rap to r2oo, yielding M2oo- 
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Fig. 2. Relation between the optical luminosity calculated in 
the SDSS r band within r2oo and the number of cluster galaxies 
contributing to Lgp. The solid line is the best fit line with slope 
1.00 + 0.03. 



As before, we scale the concentration parameter of the used 
NFW profile according to a preliminary estimate of the mass 
of the system. From M200 the final estimate of r2oo is obtained, 
using the definition of M200 itself. 

3.2. Optical luminosities 

The total optical luminosity of a cluster has to be computed af- 
ter the subtraction of the foreground and background galaxy 
contamination. We consider two different approaches to the 
statistical subtraction of the galaxy background. We compute 
the local background number counts in an annulus around the 
cluster and a global background number counts from the mean 
of the magnitude number counts determined in five different 
SDSS sky regions, randomly chosen, each with an area of 30 
deg^. In our analysis we show the results obtained using the op- 
tical luminosity estimated with the second method. The optical 
luminosity is then computed within r2oo following the prescrip- 
tion of Popesso et al. (2004). The reader is referred to that paper 
for a detailed discussion about the comparison between optical 
luminosities calculated with different methods. To avoid selec- 
tion effects due to the slightly different redshifts of the clusters, 
the optical luminosity has been calculated in the same abso- 
lute magnitude range for all the clusters. The adopted range 
has been varied to check the robustness of the results of the 
regression analyses. 
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3.3. Number density profiles and projection effects 

The observed cluster optical luminosity, Lgp, is contributed not 
only by galaxies within the virial sphere of radius r2oo, but also 
by galaxies outside the virial sphere yet within the cylinder of 
same radius. It is therefore necessary to correct the observed 
optical luminosity for the contribution of cluster galaxies out- 
side the virial sphere (the field galaxies contribution is removed 
statistically as described in the previous section). 

Fig. 12 shows the proportionality between the cluster r- 
band optical luminosity within r2oo and the number of clus- 
ter galaxies (Ngai), contributing to the luminosity itself, i.e. the 
background-subtracted galaxy counts within the same radius, 
down to the magnitude limit used to estimate L„p. Because of 
the strict proportionality between L^p and Ngai, we can use 
the ratio between the number of cluster galaxies within the 
cylindrical volume and the number of galaxies within the virial 
sphere, to correct the observed Lgp for the contribution of clus- 
ter galaxies outside the virial sphere. In order to estimate this 
ratio, we build the surface number density profiles of our clus- 
ters, and fit them with two widely-used analytical functions, the 
King (1962) cored profile, and the NFW cuspy profile. The 3D 
and projected King profiles are given by, respectively: 



and: 



no 



(1 + (r/r,)2)3/2 



(1 + (fo/r,)2) 



(2) 



(3) 



where is the core radius and ctq - In^rc is the normalization 
(see also Sarazin 1980). The NFW profile in 3D is given by: 



p{r) 



So 



r/r,(l+(r/r,)2) 



(4) 



where is the characteristic radius (r, - r2oo/c with c the 
concentration parameter) and <5o is the normalization. The pro- 
jected surface density profile is then obtained from an inte- 
gration of the three-dimensional profile (see Bartelmann et al. 
1996). 

As a first step we explore the the mean surface density 
galaxy distribution within our cluster sample, by stacking the 
projected galaxy distributions of the individual systems. Note 
that in this analysis we only consider the clusters with available 
X-ray centers, in order to reduce possible mis-centering when 
adopting the positions of brightest cluster galaxies as cluster 
centers (not all brightest cluster galaxies lie at centers of their 
parent clusters, see, e.g., Lin & Mohr 2004). The clustercentric 
distances are rescaled to the cluster raoo before the stacking. 
The cluster galaxy distributions are normalized to the total 
number of galaxies within r2oo, after subtraction of the mean 
background galaxy density, evaluated within the 2.5 -3.5 xr2oo 
annulus. Fig.|3]shows the stacked surface density profile of all 
the 217 clusters. The best fit is given by a King profile with core 
radius rc/r2oo - 0.224 + 0.005, while a NFW profile provides 
a poor fit near the centre. We then split our sample of clusters 
in 6 mass bins: M2oo/10"^Ma < 1, 1 < M2oo/1O''^M0 < 3, 
3 < M2OO/IO14M0 < 7, 7 < M2oo/10''*Mo < 10,10 < 
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Fig. 3. The stacked mean surface number density profile of all 
the cluster galaxies with magnitude r < -18.5. The solid curve 
is the best fit King profile, the dashed curve is the best fit NFW 
profile. 

M2oo/10"*Mo < 30, and M2oo/1O''*M0 > 30. Each bin con- 
tains at least 10 clusters. 

Fig. 0] shows the surface density profiles in each cluster 
mass bin. The solid line in each plot shows the King profile 
that provides the best-fit to the surface density profile of all 
galaxies in all clusters, already shown in Fig.|3 The dashed 
line in each panel is the best-fit King profile for the surface 
density profile of each cluster mass bin. The NFW profiles pro- 
vide poor fits for most cluster mass bins, and are not plotted. 
From Fig. 0]one can clearly see how the cluster galaxy distri- 
bution changes with cluster mass. The surface density profiles 
become steeper near the centre as the cluster mass increases. 
Note in fact that the surface number density profile in the low 
mass bin (M2oo/1O'^M0 < 1) is not completely consistent with 
a King profile since it shows a deficit of galaxies near the cen- 
ter. The core radius is quite large, rc/r2Qo = 0.40 + 0.08. The 
core radius becomes smaller as the cluster mass increases, and 
it is rdrzoo - 0.16 + 0.01 for clusters in the mass interval 
7 < M2oo/10"*Mo < 10. In the last two mass bins, the galaxy 
distributions become so concentrated that the simple King pro- 
file does no longer provide a good fit, and a generahzed King 
profile is needed, of the form: 



(Tib) 



0"o 



(5) 



{\+{blr,W 

The dotted lines in the panels of Fig. 0] corresponding to the 
highest cluster mass bins, show the best fit given by the gen- 
eralized King profile, where y6 - 0.91 + 0.01. Finally, in the 
highest mass bin (M200 > 3 x lO'*") also the projected NFW 
profile provides a good fit to the galaxy distributions. This is 
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Fig. 4. The stacked surface number density profiles of clusters in different cluster mass bins. The individual cluster profiles are 
obtained by considering all the galaxies with rp^tro < -18.5. The dashed curve in each panel is the King profile that provides the 
best-fit to the surface density profile shown in that same panel, while the solid curve is the King profile that provides the best-fit 
to the mean stacked surface density profile (same as the sohd hne in Fig.|3}. The dotted line in the bottom panels (corresponding 
to the highest-mass bin clusters) is the best fit provided by a generalized King profile. 



shown in Fig.|5l where the best-fit generalized King profile is 
shown as a solid curve, and the best-fit NFW profile is shown as 
a dashed curve. In this case, the best-fit value of the NFW con- 
centration parameter is c = 4.2 + 0.3, and is consistent with the 
value found for the dark matter distribution in similarly mas- 
sive clusters (e.g. Biviano & Girardi 2003; Katgert et al. 2004). 

Lin et al. (2004) perform the same analysis on a smaller 
sample of 93 X-ray selected clusters observed in the 2MASS all 
sky survey. Their conclusion is that the surface density profile 
of cluster galaxies is consistent with a NFW profile with con- 
centration parameter 2.90 + 0.22. They study the galaxy distri- 
bution in 2 mass bins with mean mass < M500 >= 7.9x IO'-'MO 
for the groups, and < M500 >= 5.3 x 10'"* M0, for the massive 
clusters, and claim that the spatial profiles are consistent with 
the mean profile in both mass bins. However, by fitting their 
data (taken from Fig. 8 of Lin et al. 2004) with both a King, 



a generalized King, and a NFW profile, we find that a King 
profile provides the best fit, in agreement with our findings. 

Our results are further supported by the analysis of the sur- 
face brightness profile of our clusters. Fig. |6l shows the com- 
posite surface brightness profile of two cluster subsamples: the 
low-mass systems with M200 < 10^* Mq and the high-mass 
clusters with M20() > 3 x IO'^Mq. The profile of the low-mass 
objects displays a core, and is less centrally concentrated than 
that of the high-mass clusters which is in fact rather cuspy. As 
expected, due to the presence of the Brightest Cluster Galaxies 
at the center of the systems, the luminosity profiles are gener- 
ally cuspier than the density profiles in the same cluster mass 
bins (Adami et al. 2001). 

In stacking clusters, we have assumed circularity, because 
the number of galaxies per cluster is generally too small to al- 
low a precise determination of individual cluster shapes and 
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Fig. 5. The surface number density profile of all the cluster 
galaxies with r < -18.5 in the highest of our considered clus- 
ter mass bins. The solid curve is the best fit provided by the 
generalized King profile. The dashed curve is the best fit pro- 
vided by the NFW profile. Both profiles are consistent with the 
data. 




Fig. 6. Composite surface brightness profile of two cluster 
samples: the low mass systems at M200 ^ IO'^^Mq (empty 
squares) and the massive clusters at M200 > 3 x lO'^M© (filled 
dots). 



orientations. Adami et al. (1998b) have shown that enforcing 
circularity could create a central artificial cusp in the number 
density profile of the stacked cluster However, lower mass 
clusters are more elongated than higher mass clusters (see 
Fasano et al. 1993; de Theije et al. 1995; PHonis et al. 2004), 
so the eff'ect of assuming circularity should lead to cuspier den- 
sity profiles for lower mass clusters, which is opposite to what 
we find. Indeed, the effect reported by Adami et al. does not 
seem to be strong enough to account for the differences seen in 
the density profiles of the stacked clusters of different masses 
(compare Fig.l^with Fig. 7 in Adami et al. 1998b). 

Hence we conclude that there is a significant variation of 
the number density and luminosity density profiles of clusters, 
as a function of cluster mass, with higher mass clusters dis- 
playing more concentrated profiles. As a consequence, also the 
correction needed to convert the number of galaxies contained 
in the cylindrical volume to that in the virial sphere depends 
on the cluster mass. Using the volume and the surface density 
King profile given in eq.|2land|3l respectively, we estimate that 
the ratio between the number of galaxies in the virial sphere of 
radius r20() and the number of galaxies actually observed in the 
cylinder of same radius is 0.69-0.76 for clusters in the lowest- 
mass bin, 0.78-0.80 for clusters in the 1 - 3 x 10''*Mo mass bin, 
0.81 in the 3 - 7 X IO'^^Mq mass bin, and 0.85 in the highest 
mass bins. 

We performed the same analysis separately for the red 
(early-type) and blue (late-type) cluster galaxy populations. 
The colour separation between the two population is based on 
the SDSS galaxy color u-r (Strateva et al. 2001, Popesso et al 
2006). For both the red and the blue galaxy populations, the 
core radius of the best-fit King profile monotonically decreases 
from the low-mass systems to the more massive clusters (see 
Figure0and|SJ. 

Since there is a significant mass-dependence of the num- 
ber density profiles, a mass-dependent deprojection correction 
needs to be applied to the observed values of Lop. In the fol- 
lowing, we only consider the deprojection-corrected values of 
Lop, obtained by adopting the correction factors per mass bin 
derived above. 

4. The Lop - M200 and the Ngai - M200 relations 

In Fig.|5]we show the r-band Lop -M200 relation after correcting 
Lop for the projection effects (see section l33t . The solid line in 
the figure is the best-fit linear regression in logarithmic space, 
and the dotted line is the best-fit we would have obtained had 
Lop not been corrected for the projection effects. The best-fit 
relation for the corrected Lop is: 

LopKlO'^L^) = 10-°-2''-°»(M2„o/(10'^Mo))°-^2^°°l (6) 

The slope of this relation is steeper than the slope of the uncor- 
rected relation, which is 0.85 ±0.03. The two values are anyhow 
marginally consistent within 2.5cr. As a consequence, also the 
M/L - M relation is flatter The slope of the corrected relation 
is 0. 1 8 + 0.04 instead of 0.27 + 0.04 for the uncorrected relation. 
Remarkably, we find that the slopes of the best-fit L - M and 
M/L - M relations do not depend on the chosen photometric 
SDSS band. 
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Fig. 7. The stacked surface number density profile of the red cluster galaxies with magnitude r < 
of different masses. The meaning of the lines is the same as in Fig.|3 



-18.5, separately for clusters 



Due to the strict proportionality observed between the clus- 
ter optical luminosity and the number of cluster galaxies (see 
Fig. 13, it is clear that the Lop - M200 relation is strictly 
connected to the Ngai - M200 relation. In other words, the 
cluster mass-to-light ratio M/L is strictly related to the Halo 
Occupation Number y of the Halo Occupation Distribution 
(HOD) oc Af. It is then useful to study the cluster M/L in 
terms of the HOD since this allows an easier comparison with 
the predictions of models of structure formation. 

We study the HOD with two approaches. First we use the 
Ngai calculated using the photometric data (Npi,ot), by summing 
the background-subtracted cluster number counts used to cal- 
culate Lop. As a second approach we estimate the number of 
spectroscopically-confirmed cluster members (Nspec)- Both es- 
timates are corrected for projection effects in the same way as 
we did for Lgp. Both Nphot and Nspec are computed down to the 
same absolute magnitude, in order to allow comparison of the 
two estimates. The SDSS spectroscopic and photometric cata- 
logs have two different apparent magnitude limits (r - 17.77 



for the spectroscopic catalog and r ~ 21 mag for the photo- 
metric one). We apply an absolute magnitude cut of M, < -20, 
which allows us to sample the cluster luminosity function (LF 
hereafter) down to M* + 2 (Popesso et al. 2005a). With such a 
cut, Nspec can be measured for a significant fraction of our clus- 
ter sample, those 90 clusters for which < -20 is brighter 
than the apparent magnitude limit of r = 17.77. 

In Fig. ^1 we show the Ngai - A/200 relation, using Ngai = 
Nphoi ■ We also plot the best fit relations 



Ngai = 10 
for Ngai = N, 

Ngai = 10 



-11.60±0.59 



gal — phot 

-I1.43±0.76 



(M200/M0)' 

and 



0.91 ±0.04 



(M200/M0)' 



0.89±0.05 



(7) 



(8) 



for Ngai = Nspec- The two estimates of the halo occupation 
number y are consistent, while the different normalizations 
reflect the incompleteness of the spectroscopic samples (see 
Popesso et al. 2006). The orthogonal scatter in both relations 
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Fig. 8. The stacked surface number density profile of the blue cluster galaxies with magnitude r < -18.5, separately for clusters 
of different masses. The meaning of the lines is the same as in Fig.|3 



is 35%, and M200 can be predicted from Ngai with an accuracy 
of 55%. 

Had we not corrected Ngai for the projection effects, we 
would have underestimated the slope for the Ngai - ^200 rela- 
tion, obtaining 0.79 + 0.04. Clearly, applying an average, mass 
independent, correction to the observed value of Ngai and Lgp 
leads to underestimate the slope of the considered relations. 

We check also if different cluster selection techniques in- 
troduce biases in our analysis. For this purpose we perform the 
same analysis separately on the optically and X-ray selected 
cluster samples, respectively. The observed best fit values are 
consistent within the statistical errors. Moreover, we perform 
the analysis by adopting different magnitude cuts to check for 
vatiation of the Halo Occupation number in different magni- 
tude regimes. We consider the following magnitude cuts: -20, 
-17 and -16 mag in the i band. While the normalization of the 
relation is obviously changing, the best fit values of the Halo 
Occupation number are cosistent within the errors in all the 
magniture ranges. 



The Halo Occupation Number y has been measured with 
several different methods in the literature. Most of these come 
from assuming a form of the HOD, and adjusting the parame- 
ters until the prediction from the halo model matches the ob- 
served galaxy clustering (e.g. Seljak et al. 2004, Peacock & 
Smith 2000; Yang et al. 2003; Zehavi et al. 2004; Magliocchetti 
& Porciani 2003). Pisani et al.(2003) used the velocity dis- 
persion in the groups of the Zwicky catalog and obtained 
r = 0.70 + 0.04, while Marinoni & Hudson (2002) used 
the LP of the Nearby Optical Galaxy sample and obtained 
r = 0.55 + 0.043. 

Many other works in the literature used an approach similar 
to ours. Kochanek et al. (2003) used a sample of clusters identi- 
fied in the 2MASS all sky survey, and obtained y - 1.11+ 0.09 
on a sample of 84 clusters. Lin et al. (2004) used a sample of 93 
X-ray clusters observed in 2MASS, and found y - 0.84 + 0.04. 
Similar results were obtained by Yang et al. (2005) who used a 
large sample of groups identified in the 2-degree Field Galaxy 
Redshift Survey. 
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Fig. 9. Lop - M200 relation. The optical luminosity is calculated 
within ^200 and is corrected for contamination due to projection 
effect. The empty squares in the plot are the X-ray selected 
clusters. The filled points are the optically selected clusters. 
The soUd Une in the plot is the best fit line of the corrected 
L„p — M200 relation. The dashed fine is the best fit line of the 
uncorrected Lgp - M200 relation. 

With the exclusion of Kochanek et al. (2003), all other stud- 
ies agree on the fact that the exponent in the N-M relation, and 
consequently in the L- M relation, is not consistent with unity 
(see Lin et al. 2004 for a discussion about the discrepancy with 
the results of Kochanek et al. 2003). However, with the mass- 
dependent correction applied to our clusters to clean the N-M 
(L - M) relation from projection efi'ects, the estimated value of 
y becomes closer to unity. Nevertheless, a direct proportional- 
ity between cluster mass and number of cluster galaxies is still 
excluded by our analysis at the ~ 2-2.5cr level. 

5. Luminosity function sliape and cluster mass 

In this section we investigate whether the lack of galaxies ob- 
served in the high-mass systems is related to a different shape 
of the LFs of clusters of different masses. The universality of 
the cluster LF has been analysed in two papers of the RASS- 
SDSS Galaxy Cluster Survey Series (Popesso et al. 2005a, 
Popesso et al. 2006). When measured within the cluster virial 
radius (r2oo), the shape of the LF does not change from cluster 
to cluster both at the faint and at the bright end (Popesso et al. 
2006). Moreover, we observed that the cluster to cluster vari- 
ations of the LF found in the Uterature are due to choice of a 
fixed metric apertures for all the systems. This is due to the fact 
that fraction of dwarf galaxies in clusters is an increasing func- 
tion of the clustercentric distance (see also Durret et al. 2002). 




0.01 0.1 1 10 



M3oy(10'*Mj 

Fig. 10. Ngai - M200 relation. The number of cluster galaxies 
is calculated within ^200 and is corrected for contamination due 
to projection effect. The empty squares in the plot are the X- 
ray selected clusters. The filled points are the optically selected 
clusters. The solid line in the plot is the best fit Une of the cor- 
rected Ngai - M20() relation. The dashed line is the best fit Une 
of the uncorrected Ngai - ^200 relation. 




Fig. 11. Relation between the i band petrosian absolute magni- 
tude and the galaxy stellar mass. The galaxy stellar masses are 
taken by Kauffinann et al. (2003). 
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Fig. 12. The composite LF for the whole (left panel) and the red (right panel) cluster galaxy populations. In both panels the filled 
points are the low-mass clusters (M2oo/IQ^'^Mq < 1), the empty squares the intermediate-mass systems (1 < M2()o/10''*Mq < 10) 
and the empty triangles are the high-mass clusters (Maoo/lO'^^Mo > 10). The different mass-bin LFs are renormalized so as to 
ease the comparisons among them. The solid lines in the panels are the best fits obtained in Popesso et al.(2006) from the X-ray 
selected RASS-SDSS galaxy clusters for the corresponding whole and red cluster galaxy populations. 



To keep under control also the possible dependence between 
the shape of the LF and the cluster mass, we divided our clus- 
ter sample (with mass ranges from lO'^M© to 4 x lO'^M©) in 
three mass bins (M2oo/1O''*M0 < 1, 1 < M2oo/10"*Mo < 10 
and Mioo/lO'^^Mo > 10). To increase the statistics and study 
the average luminosity distribution of the galaxies in each mass 
bin, we have used the SDSS photometric data to compute a 
composite luminosity function (LF) by stacking the individual 
cluster LFs calculated within r2oo- The individual LFs are ob- 
tained by subtracting the field number counts calculated within 
an annulus around the cluster (0.2 deg with), from the num- 
ber counts in the cluster region, as described in Popesso et al. 
(2005a). Following previous works, we exclude from the in- 
dividual cluster LFs the Brightest Cluster Galaxies (BCGs). 
The composite LF in each mass bin is calculated by follow- 
ing the prescriptions of Colless (1989, see also Popesso et al. 
2005a for more details about this method). We require at least 
10 clusters contributing to each magnitude bin of the com- 
posite LF. This requirement is fulfilled at magnitudes brighter 
than the absolute magnitude limit / - 5 log{h) < -16.7 mag in 
all the cluster mass bins considered, while 95% of our clus- 
ters have magnitude limits brighter than -18.25 mag in the 
i band. Thus, we consider galaxies down to 5.5 mag fainter 
than the cluster M* in this SDSS band (Popesso et al. 2006). 
Moreover, we use the stellar masses estimated by Kauffmann 
et al. (2003) for the DR2 spectroscopic sample to evaluate the 
stellar mass range sampled within this magnitude limit. As 
shown in Fig. ^2 although the scatter is large (0.18 dex), the 
magnitude cut at -16.7 mag corresponds roughly to a galaxy 



stellar mass of 1.5 x 10^ Mq. As in section l33l we distinguish 
between early and late type galaxies using a SDSS color cut 
at u - r - 2.22. In Fig. ^] we show the composite LF for the 
whole (left panel) and the red (right panel) cluster galaxies pop- 
ulations. In both panel the filled points are the low-mass clus- 
ters (M2oo/1O''*M0 < 1), the empty squares the intermediate- 
mass systems (1 < M2qo/10^*Mq < 10) and the empty tri- 
angles are the high-mass clusters (M2oo/10'^Mo > 10). The 
different mass-bin LFs are renormalized so as to ease the com- 
parisons among them. The solid lines in the panels are the best 
fits obtained in Popesso et al. (2006) from the X-ray selected 
RASS-SDSS galaxy clusters for the corresponding whole and 
red cluster galaxy populations. From Figure[21it is clear that, 
at magnitudes brither than -16.7 mag (alternatively, for galaxy 
stellar masses above 1.5 x 10*^ Mq), there are no significant dif- 
ferences among the LFs in the different mass bins. Moreover, 
the best fit of the composite LF of the X-ray selected RASS- 
SDSS sample provides a very good fit to any of the considered 
LFs. We conclude that the cluster LF does not depend on the 
cluster mass. This conclusion is consistent with our previous 
findings (Popesso et al. 2006). 

The previous analysis is based on LFs with the BCGs ex- 
cluded. Here we examine to what extent can the BCG lumi- 
nosities be considered the high-end tail extension of the overall 
cluster LFs. This has been shown not to be the case by pre- 
vious investigations (e.g. Schechter 1976, Bhavsar & Barrow 
1985). The Schechter function was generally found to provide 
a good fit to the observed galaxy magnitude distribution as long 
as the very brightest galaxies, the cD galaxies, were excluded 
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Fig. 13. Upper panel: The magnitude of the BCGs (empty 
squares) within raoo as a function of the number of cluster 
galaxies within the same radius, and with r < -18.5 mag. 
The solid line in the plot shows the expected BCG magnitude 
as estimated from the best-fitting composite LF, as a function 
of the cluster normalization. The dashed lines are the statisti- 
cal uncertainties in the relation represented by the solid line. 
The filled squares are the median magnitudes per bin of Ngai- 
Middle panel: same as the left panel, but for the second bright- 
est cluster galaxies. Bottom panel: same as the left panel, but 
for the third brightest cluster galaxies. 



from the fit (Schechter 1976). The exceptional luminosities of 
these galaxies have often been interpreted as arising from spe- 
cial processes that are not common to all galaxies, and are 
particularly effective at the bottom of cluster potential wells. 
Nevertheless, Lugger (1986) did not find significantly differ- 
ent best-fits to the observed cluster galaxy LFs using Schechter 
functions, when BCGs were or were not included in the sam- 
ple. 

Following Colless (1989), we normalize the cluster LFs to 
the number of cluster galaxies in a common magnitude region 
(r < -18.5 mag in the present case, see Popesso et al. 2005afor 
details). Given the number of cluster galaxies in that magnitude 
region and the best-fit Schechter function of the composite LF, 
it is possible to calculate the magnitude of the «''' brightest 
cluster member as the magnitude corresponding to N{Mr) = n, 
where N(Mr) is the analytical form of the cluster integral LF. 
For this we use the best-fit obtained with a composite of two 
Schechter functions, after excluding the BCGs. 

The left panel of Fig.^jshows the magnitude of the bright- 
est spectroscopically-confirmed cluster members within r2oo, 
as a function of the number of cluster galaxies within r2oo and 
with r < -18.5 mag. The solid line shows the expected magni- 
tude of the brightest galaxies, as estimated from the best-fit LF, 
vs. the cluster normalization. The dashed line are the statistical 
uncertainties in the location of the brightest cluster member. 
Clearly, the estimated magnitudes of the 1 " ranked galaxies are 
consistent with the observed values, as can be judged by con- 
sidering the median of the P'-ranked galaxy magnitudes per 
Ngai bin (filled squares in the plot), and by the fact that 95% of 
the BCGs lie within the statistical uncertainties of the expected 
relation. The middle and the right panel of Fig.[0]are similar to 
the left panel, but for the 2"'' and 3"' brightest cluster galaxies 
respectively. Again, the agreement between the expected and 
observed magnitudes is extremely good, and the similarity of 
these three plots argues against the BCG magnitudes being an 
anomaly of the cluster LF. 

The reason why our result disagrees with previous findings 
(Postman & Lauer 1995) must be related to the use of a double 
(instead of a single) Schechter function for the fit of the ob- 
served LF, which allows a better representation of the LF bright 
end. This was first pointed out by Biviano et al. (1995) in their 
study of the Coma cluster LF (see also Thompson & Gregory 
1993). The deviation of the cluster LF from a single Schecter 
function was also found in the clusters extracted from the N- 
body simulations combined with semi-analytical models anal- 
ysed by Diaferio et al. (1999). They interpreted the LF shape 
as the effect of the large merger cross-section of the bright and 
massive central galaxies. 

Our result is in agreement with the recent findings of Lin 
& Mohr (2004) and Yang et al. (2005) of a tight correlation 
between the BCG luminosity and the cluster mass. In partic- 
ular, in the mass range 10'^ < M2oq/Mq < 10'^ it results 
Lbcg °^ excellent agreement between Lin & Mohr's 

result and ours is demonstrated in Fig.^^ There we show the 
relation between the BCG luminosity and the cluster mass of 
our cluster sample, where we transformed the cluster Ngai into 
cluster masses using the HOD we derived in section |3 The 
solid line in the plot is the best-fit obtained with an orthogo- 
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Fig. 14. The relation between BCG luminosity and cluster 
mass for our cluster sample. Cluster masses are computed from 
Ngais using our derived HOD. The solid line in the plot is 
the best-fit line obtained with an orthogonal linear regression, 
Lbcc 0^ M^'o^j'=^" "\ and the dashed hne is Lin & Mohr's (2004) 
relation. Other symbols have the same meaning as in the Fig. 

m 

nal linear regression, Lbcg ^20?'''"''*' ™d it is in excellent 
agreement with the Lin & Mohr (2004) relation (the dashed 
line in the plot). 

6. The fundamental plane of cluster ellipticals 

The elUptical galaxies are the dominant population in clus- 
ters and therefore any variation of their mass-to-light ratio as a 
function of the cluster mass could contribute affecting the slope 
of the N - M and the L- M relations. I.e. one could still have a 
constant ratio between the total cluster mass and the total mass 
in galaxies, even for y < 1 (see section|3, if galaxies of given 
luminosity have higher masses in higher-mass clusters. 

To investigate whether elliptical galaxies in high mass clus- 
ters have a higher average M/L than their counterparts in low 
mass systems, we determine the fundamental plane (FP here- 
after) of ellipticals as traced by the spectroscopic members 
of each cluster within r2oo. The FP relates the effective ra- 
dius of the luminosity distribution of ellipticals, r^, with their 
internal velocity dispersion, cr, and their surface brightness 
(Djorgovsky & Davis 1987). If the virial radius of ellipticals is 
linearly proportional to and their internal velocity dispersion 
to the virial value, the FP effectively can be used to constrain 
the mass-to-light ratio of elliptical galaxies. 

For this analysis, as before, we have divided our cluster 
sample in three subsamples of low-, intermediate-, and high- 
mass. Ellipticals are identified within each cluster using the se- 



lection criteria of Bernardi et al. (2003a). As a measure of the 
effective radius we use the Petrosian radius rso, which encloses 
50% of the total Petrosian luminosity, multiplied by the square 
root of the ratio b/a of the lengths of the minor and major axes 
of the observed surface brightness profile. The SDSS spectro- 
scopic catalog contains a measure of the line of sight velocity 
dispersion which has been corrected for aperture effects as in 
Bernardi et al. (2003a). In what follows, we show the best cor- 
relation between the variables r^, cr and fi = -2.5 log[(L/2)/r^] 
in the SDSS r-band. Data are fitted with the ODRPACK routine 
(Akritas & Bershady 1996). 

The left-hand side panel of the fig.ll5lshows the best-fit FP 
relating the three variables cr, r,, and fi; our result is consistent 
with Bernardi et al. (2003b). The different symbols in the plot 
indicate elliptical galaxies of clusters in different mass bins. We 
do not find any variation of the best-fit FP for the different mass 
samples. In the right panel of the same figure we show the mean 
residuals from the FP of the elliptical populations of individual 
clusters as a function of the cluster masses. The scatter around 
the FP is ~ 10% as in Bernardi et al. (2003b) and the mean 
residual is consistent with zero independent of the cluster mass. 

Any variation of the average mass-to-light ratio of the ellip- 
ticals would result in a variation of their FP. The constancy of 
the FP across the cluster mass range therfore implies a constant 
mass-to-light ratio of the cluster ellipticals. 

7. Discussion 

Using a large sample of galaxy clusters we have shown that the 
number of galaxies per unit mass is lower in clusters of higher 
masses, i.e. the slope of the relation oc M'*' is 7 < 1 at the 2.5 
cr level. Our result is thus in agreement with previous findings 
(see, e.g., Lin et al. 2004 and references therein) although we 
find a somewhat steeper N - M relation (y closer to unity), be- 
cause of our mass-dependent correction for projection effects. 

^From the theoretical point of view, y < 1 is expected. On 
one hand, hierarchical models of structure formation predict a 
universal mass distributions of the subhalos (see, e.g., De Lucia 
et al. 2004, and Gao et al. 2004), independent from the mass of 
the parent halo. As a consequence, the number of subhalos is 
directly proportional to the parent halo mass (A^ oc M). On the 
other hand, including baryons in the simulations leads to a de- 
creasing number of galaxies per given mass in halos of higher 
masses (i.e. y < 1, see, e.g., Benson et al. 2000a, 200b; White 
et al. 2001; Berlind et al. 2003), and of an increasing MjL as a 
function of mass (e.g. Kauffmann et al. 1999). This could hap- 
pen as the consequence of a decreasing efficiency of gas cool- 
ing and star formation (see, e.g., Springel & Hernquist 2003; 
Berlind et al. 2003; Borgani et al. 2004; Kravtsov et al. 2004), 
or because of an increased merger rate (White et al. 2001), or 
of an increased destruction rate of galaxies (Lin et al. 2003), as 
the mass of the parent halo increases. 

Although we observe y < 1 as predicted, a deeper look at 
other properties of our clusters seems to be at odds with theoret- 
ical predictions. If mergers and/or tidal effects are responsible 
for reducing the number of galaxies per given mass in clusters 
of higher masses, we expect to see these processes to leave an 
imprint into the cluster LFs. Instead, we find that the LF is uni- 
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Fig. 15. Left panel: the best-fit FP of cluster ellipticals, relating their internal velocity dispersions (cr), eff'ective radii (re), and 
surface brightnesses within (ju). The different symbols in the plot indicate elliptical galaxies of clusters in diff'erent mass bins. 
The right-hand side panel shows the mean residuals from the global FP of the elliptical populations of individual clusters as a 
function of the cluster mass. 



versal, with no dependence on the cluster mass. Our result is at 
odds with Lin et al. (2004). The reason for this difference is un- 
clear, but it could be related to the different photometric bands 
(Lin et al. use the K-band), although it is difficult to see why 
the LFs of clusters of different masses should look identical in 
four SDSS photometric bands and different in the K-band. 

Another result argues against galaxy-destruction via tidal 
stripping being more efficient in higher mass clusters. Higher 
mass clusters are characterized by a more concentrated num- 
ber density profile (see section 13. 3> and a less concentrated 
mass density profile (NFW; Katgert et al. 2004; Pratt & Arnaud 
2005) near the centre. If anything, this is consistent with a pic- 
ture where galaxies are more likely to survive near the centre 
of higher mass clusters, while galaxies in lower-mass clusters 
are destroyed when crossing the cluster core, because of the ef- 
ficient tidal stripping resulting from a more concentrated halo 
mass profile. 

A lower efficiency of star formation in galaxies of higher 
mass clusters would also lead to observe y < 1 in the HOD. 
A consequence of this process should be visible in a decreased 
M/L for the galaxies of higher mass clusters, as compared to 
the galaxies of lower mass clusters. We have explored this pos- 
sibility by the analysis of the FP of cluster ellipticals. No ev- 
idence for a variation of the FP as a function of cluster mass 
was found. This result argues for a constant M/L and hence a 
similar star formation efficiency of cluster ellipticals in clus- 
ters of different masses, in agreement with the predictions of 
Diaferio et al. (2001), based on numerical simulations com- 
bined with semi-analytical models of galaxy formation. Note, 
however, that Springel et al. (2001) have argued that even heav- 



ily stripped cluster galaxies obey the Faber- Jackson relation, 
since the internal velocity dispersion of a stripped subhalo re- 
mains relatively stable until it is fully disrupted. Hence the con- 
stancy of the FP does not rule out the possibility of subhaloes 
stripping. 

Bahcall & Comerford (2002) have suggested that the ob- 
served increasing M/L of clusters as a function of cluster 
masses is a consequence of a higher fraction of galaxies with 
evolved stellar populations in higher mass clusters. There is no 
evidence for this in our data (Popesso et al. 2005e). Moreover, 
Bahcall & Comerford's prediction that M/L vs. M would be- 
come flatter when the photometric band is moved to longer 
wavelengths, is also ruled out by our data, where we see that the 
relation does not change by changing SDSS photometric band, 
in agreement with the results of numerical simulations com- 
bined with semi-analytical modelling (Kauffmann et al. 1999). 

How can then we reconcile the observed N-M with the pre- 
dictions for a universal subhalo mass distribution? It is hard to 
find physical processes capable of reducing the number of ob- 
served galaxies per given mass, while at the same time leaving 
the subhalo mass distribution, the galaxies LF, and the average 
galaxy mass-to-light ratios unchanged. Hence, the most likely 
explanation is that the mass distribution of the subhalos is not 
universal and the observed y < 1 for galaxies simply reflect an 
underlying y < 1 for subhalos. 

A final considerations is in order Our correction for projec- 
tion effects does work in the sense of changing the observed y 
of the oc closer to unity. The resulting y is still found to 
be below unity, but the significance of this is not overwhelm- 
ing (2.5 <T level). Hence it is well possible that other insofar 
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unapplied corrections, or, perhaps, an improved correction for 
the projection effects, could make y consistent with unity, thus 
reconciUng theory and observations. 

8. Conclusions 

We have studied the L - M and the N - M relations in the 4 
SDSS bands g, r, i, z for a sample of 217 galaxy clusters with 
confirmed 3D overdensity in the SDSS DR3 spectroscopic cat- 
alog. All the quantities are measured within the characteristic 
cluster radius r2oo- We have remarked upon the direct connec- 
tion between the two relations due to the proportionality of the 
cluster optical luminosity and the number of cluster galaxies. 

We have studied the galaxy surface number density pro- 
file in five bins of cluster mass and discovered that the profile 
has a strong dependence on the cluster mass. In the low and 
intermediate mass systems the best fit is provided by a King 
profile. The core radius of the best fit is decreasing as a func- 
tion of the cluster mass, while the central galaxy density is in- 
creasing. In the highest mass bins a more concentrated general- 
ized King profile or a cuspy NFW profile provide the best fits. 
Using the best fit profile in each mass bin, we have converted 
the observed number of cluster galaxies to the value within the 
virial sphere. Since clusters of different masses exhibit different 
surface density profiles, the deprojection correction decreases 
with the cluster mass. Applying this mass-dependent correc- 
tion affects the L - M and N - M relations, by increasing the 
slope of these relations to the value of 0.92 + 0.03. Similarly, 
also the slope of the M/L - M relation is affected and becomes 
0.18 + 0.04. Hence, neglecting the dependence of the deprojec- 
tion correction on the cluster mass, leads one to underestimate 
the slope of the Lop - M200 and Ngai - M200 relations. Despite 
the deprojection correction, the derived N - M and the L - M 
relations are still only marginally consistent with unity, at the 
2.5cr level, i.e. direct proportionality between cluster mass and 
number of cluster galaxies is not supported. 

We have compared the properties of our clusters with the 
prediction of the hierarchical models of structure formation. 
These models naturally predict that oc with y < 1 . This 
result is generally interpreted as the indication that the galaxies 
in the low mass systems are older and more luminous per unit 
mass than the galaxies in high mass clusters. As a consequence, 
variations of the shape of the cluster LF and of the elliptical FP 
with the cluster mass are also expected. Such predicted varia- 
tions are however not seen in our data. Not only we found the 
LF to be the same for clusters of different masses, but we also 
proved that this universal LF can be used to accurately predict 
the magnitudes of the three brightest cluster galaxies, given the 
LF-normalization of the clusters in which they are located. In 
other words, the BCG magnitudes are consistent with being 
drawn from the best-fit magnitude distribution of other cluster 
galaxies. Moreover we have shown that the FP of cluster ellip- 
ticals has the same slope in all the clusters and does not depend 
on the cluster mass. 

We conclude this paper with the following considerations. 
From the observational point of view, the mean cluster lumi- 
nosity function and the N - M or the L - M relation determine 
completely the luminosity distribution of cluster galaxies. The 



mean cluster LF constrains with high accuracy the shape of the 
luminosity distribution in clusters, while the N - M relation, 
calculated in a given magnitude range, fixes the normalization 
of the LF as a function of the cluster mass. Forthcoming cosmo- 
logical models of galaxy formation should aim at reproducing 
this characteristic of the cluster galaxy populations, in order to 
understand the processes of galaxy formation and evolution in 
the cluster enviroment. 
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